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ABSTRACT 

High  speed  processing  is  the  need  of  industries  and  so 
industry  after  industry  shifts  to  the  Digitization  .  This 
Digital  Economy  gives  a  new  look  to  the  industries  by 
massive  changes.  The  aim  of  this  paper  is  to  give  a 
overview  on  CAP  in  GDB  and  ACID  properties  in 
RDBMS  and  then  why  both  developers  and  enterprises 
typically  aim  at  providing  simple,  yet  very  efficient, 
solutions  for  specific  problems  to  speed,  data  flexibility 
and  unprecedented  levels  of  scale.  Next  focus  will 
towards  what  are  problems  while  migration  from  SQL 
to  NoSQL. 
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Introduction: 

As  the  digitized  era  More  and  more  peoples  are  doing 
more  and  more  online,  whether  at  home,  at  work,  or  on 
the  go.  Everything  like  from  payment  of  hand  to  hand 
shopping,  online  reservations,  online  shopping  for 
groceries  -  the  list  goes  on  and  on..  This  online  trend 
addresses  the  few  Technical  challenges  as:  Scaling, 
Performance,  maintenance  of  the  Databases.  From  the 
beginning  of  20th  century  companies  faces  question 
for  the  data  storage.  Before  this  Relational  database 
was  in  use  and  was  not  a  issue  of  Digitized  data.  With 
facing  this  problem  of  big  data  few  companies 
introduced  a  alternative  databases  for  market 
application.  They  are  concern  about  “one  size  fits  all” 
data-stores.  Storage  problem  get  desolve  partial 
manner.  These  questions  were  just  because  of  problem 
of  data  storage.  According  to  Jon  Travis,  people  are 
looking  at  other  technologies  because  the  volume  of 
data  is  getting  so  huge  day  by  day.  Jon  Travis  is  an 
engineer  at  Spring  Source  [1].  And  hence  researchers 
were  in  search  of  magical  database  which  will  be  able 


to  solve  the  big  data  problem  and  also  can  provide  the 
solutions  for  the  most  Big  Data  requirements  for  the 
huge  volume  of  data  and  velocity  of  request. 

In  order  to  overcome  problem  of  scalability  variety  of 
non-relational  structural  solutions  have  been 
introduced  ,  such  as  Dynamo  by  Amazon  ,  BigTables 
by  Google.  In  2006  and  2007,  Google  and  Amezon 
give  the  research  papers  which  explain  the  need  of  new 
featured  database  with  agility  and  high  scalability.  Also 
there  was  a  need  of  such  tool  or  services  which  able  to 
support  an  ever  increasing  number  of  users  and  data 
[3], 

Big  Data  is  Form  with  3Vs  namely  Variety,  Volume 
and  Velocity.  In  [2]  researcher  defines  the  term  Veriety. 
According  to  him  variety  belongs  to  the  various  data 
formats  of  the  data  being  generated  and  stored  and 
volume  is  amount  of  data  and  velocity  for  the  rate  at 
which  this  data  is  produced  [2],  Hence  the  need  of 
strong  foundation  of  any  real-time  big  data  architecture 
introduced.  A  non  relational  set  of  architecture  termed 
as  NoSQL(Not  only  SQL). 

The  term  “NoSQL”  first  applied  by  Carlo  Strozzi  in 
non  -structural  language,  though  it  was  based  on  a 
relational  mode.  Use  of  NoSQL  gives  a  fast,  portable 
and  relational  database  management  system  with  huge 
Scalability.  [8].  Now  a  days  the  term  “NoSQL”  is 
oftenly  use  by  breaking  word  “NoSQL”  as  “Not  only 
SQL”  and  consider  backbone  for  non-relational  storage 
systems.  Most  of  the  noSQL  systems  are  developed 
specifically  to  address  scalability  and  availability. 

Literature  Review: 

Around  1972,  Donald  D.  Chamberlin  and  Ray  Boyce 
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who  was  the  members  of  reputed  IBM  group  and  at 
that  time  Ted  Codd  was  a  computer  scientist  at  IBM’s 
San  Jose  Research  Laboratory  who  was  proposing  a 
new  way  of  organization  for  data  that  he  called  the 
“relational  data  model”  for  the  persistent  data.  He 
named  This  Query  Language  SEQUEL  (Structured 
English  QUEry  Language). In  1976-77  next  reluctant 
version  was  quickly  named  SEQUEL/2, but  had  to  be 
renamed  to  SQL  due  to  legal  reasons[5]. 

In  the  era  of  mainframes  and  business  applications 
many  developers  use  a  mature  query  technology  that  is 
SQL.  From  the  beginning  RDBMS  were  very  popular. 
In  fact,  in  1979  the  first  commercial  implementation 
was  released  by  a  small  and  unknown  software 
company  named  “Relational  Software”.  This  unknown 
software  company  now  becomes  a  popular  as  Oracle. 
Johannes  Zollmann  described  in  his  chapter  about  the 
ACID  properties  of  the  RDBMS.  He  also  notes  that 
ACID  properties  give  strong  guarantees  on  consistency 

[9]- 

Though  ACID  makes  a  strong  to  relational  databases 
some  drawback  of  type  is,  These  databases  were 
works  with  single  server  that  means  way  of  enhancing 
the  capacity  is  only  upgrading  the  server.  Another 
researcher  Antro  Salminen  note  that  in  his  seminar  that 
for  scale-up  the  RDBMS  only  way  is  by  adding  the 
Hardware  processing  power[6]  RDBMS  also  has 
physical  storage  limit[3]. Hence  Such  database  was 
required  that  support  current  generation  web,  mobile, 
and  other  applications  to  operate  at  any  scale  .As 
need  is  a  Mother  of  innovations,  when  everyone  was 
facing  a  problem  of  Big  data  there  was  a  introduction 
of  NoSQL  by  the  researchers. 

NoSQL  is  divided  into  4  different  types  as  Key  Values, 
Documents,  Colum  and  Graphs.  Each  database  has  its 
own  advantages.  Selection  of  type  is  depending  on  the 
type  of  application  and  its  requirement.  Some  of  the 
advantage  of  NoSQL  which  make  it  powerful  is 
Horizontal  scaling,  Demoralization,  Replication. 

NoSQL  gives  the  benefit  of  time  over  relational 
databases’  join  queries  by  providing  a  feature  of  simple 
graph  traversal  operation  queries.  We  can  store 
frequently  required  information  in  one  table  in  NoSQL 
with  its  horizontal  Scaling  property.  NoSQL  also  able 
to  handle  all  the  table  Joins  at  application  level. 
Another  feature  of  NoSQL  is  Data  repetition  is 
acceptable.  This  feature  of  NoSQL  helps  to  improve 
efficiency  and  execution  speed  of  query.  However 
tables  are  not  related  in  NoSQL  so  we  need  to  very 
careful  and  needs  to  synchronize  the  data  while 


updating  the  Tables. 

Markus  Winand  wrote  in  his  blog  that  Many  NoSQL 
systems  claimed  to  solve  all  performance  problems  by 
feature  of  horizontal  scalability.  Though  for  the 
current  generation  database  scalability  is  limited  to 
write  operations  and  is  accomplished  with  eventual 
consistency  model.  A  strict  consistency  model  of  SQL 
databases  make  slows  down  write  operations,  still  that 
does  not  necessarily  imply  bad  throughput  [19]. 

Migration  of  Data  between  SQL  and  NoSQL  depending 
on  the  style  of  NoSQL  database.  Relational  database 
expert  Chris  Bird  observed  syntactical  difference 
NoSQL  and  SQL,  Also  found  that  hardness  for 
migration  like  some  mental  gymnastics  requirement  for 
new  users  of  NoSQL.  Hence  the  process  for  migration 
is  depend  on  which  technology  of  NoSQL  technology 
going  to  be  selected  for  use.  Some  researchers  had 
tried  to  cover  the  gap  between  RDB  and  GDB  .Graph 
Databases  are  able  to  represent  as  graph  of  any  kind  of 
information,  naturally  accommodate  changes  in 
data[15].  Cypher  Gremlin,  are  examples  graph  data 
query  languages. 

Some  noSQL  db  doesn't  support  range  queries  or  joins, 
users  are  restricted  in  what  user  can  view  and  how  fast 
user  can  view[10].  An  Oracal  white  paper  note  that 
Choice  of  new  system  is  an  exciting,  strategic  business 
activity  that  usually  entails  working  with  new 
technologies,  suppliers,  and  opportunities  under  the 
head  of  successful  data  migration  in  October  2011.  It  is 
also  observed  that  without  a  sufficient  knowledge  of 
both  source  and  target,  data  migration  can  cause  major 
or  minor  or  any  hidden  legacy  problems,  which  is 
nothing  but  playing  with  risk.  [17]. 

It  is  usually  assumed  that  target  of  migration  knows  the 
existing  systems  and  support  a  same  structure. 
However  NoSql  Query  Stucture  is  harder  to  gain  user 
acceptance  .  Researcher  Patricia  Cavoto  observed  in 
his  research  work  that  Complex  Structer  of  relational 
DB  cause  the  complexity  can  be  while  performing  a 
data  analysis  in  RDB.  Also  the  relational  models  are 
not  as  flexible  as  the  graph  model  for  data  analysis[ll] 
.some  researchers  found  that  relational  database  might 
require  very  sophisticated  and  expensive  operations 
also  needs  complex  join  operations  to  fetch  required 
results  in  RDBMS  which  affect  the  performance  and 
efficiency  deterioration  [15,16], 

In  concern  with  rethinking  and  remodeling  of  data 
migration  which  is  essential  part  of  almost  every 
organization,  While  the  migration  of  data  from  SQL 
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between  NoSQL  developers  need  to  think  about  how 
to  represent  your  existing  model  in  new  database[18]  it 
means  mechanism  for  storage  and  retrieval  of  data  are 
designed  in  different  way  in  both  databases. 


Figure  1:  sql-vs-nosql-databases[21] 

Power  of  SQL: 

ACID  compliancy:  ACID  property  is  the  best  power  of 
SQL  which  reduces  anomalies  and  protects  the 
integrity  database  . 

Security:  In  Handling  of  data  in  DBMSs  security  of 
data  is  important  .In  security  concerns  NoSQL 
databases  loose  his  security  by  allowing  permissions 
or  access  control  in  these  systems  are  to  be  provided  it 
means  it  lacks  the  with  the  respect  to  Security  as 
compare  to  SQL. 

Limitations  of  SQL: 

Challenge  of  scalability  and  elasticity  achievement  is  a 
huge  related  to  the  relational  databases  .  Researcher 
also  found  that  a  small  change  to  one  table  can  cause 
changes  across  the  system  [20]. It  is  observe  that 
response  time  of  an  SQL  query  changes  with  many 
factors.  One  of  them  is  volume  and  Scalability.  These 
factors  are  also  related  to  dependency  of  performance 
the  data  volume  [19] 

One  another  limitation  of  SQL  is  fixed  set  of  columns 
which  is  not  suitable  for  big  data  and  hence  there  is  a 
demand  for  non-relational  databases.  Leonardo  Rocha 
and  et  ,al;  point  out  that  Challenges  in  Migration: 
Recontruction  of  Module 

Conclusion: 

From  the  survey  we  can  conclude  that  - 

a)  existing  softwares  not  only  large  and  but  also 
medium  scale  software  applications  are  relye  on 
rlational  database. 

b)  Due  to  the  variation  in  synatatical  and  storage 
structure  of  SQL  and  NoSQL  thrir  is  problem  of 
migration. 

As  amount  of  data  stored  in  SQL  has  a  impact  on  its 


performance,  i.e  Data  stored  and  the  performance  are 
inversely  proportional  to  each  other.  Effect  of  this  is  a 
query  becomes  slower.  But  can  we  improve  this  ratio 
by  giving  a  power  of  Horizontal  Scaling  to  SQL  of 
NoSQL  is  the  key  questions  when  discussing  dataset 
scalability. 
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